
-EXTERNAL 



[TERMINAL- [4247-3 . LOG] ] 



Emhtg7:A1158158 Begin: 199213 
I A115 8158 Human DNA sequence *** SEQ... 



24.07.200Q/i 3! n . 58 



End: 
77 



199231 
299 77 



Emhtg2:Ac011126 
t ac011126 Homo s 
Emhtg6:Ac023834 
! Ac023834 Mus mu 
Emhtg2 : Ac012454 
! Ac012454 Homo 
Emhtgl :Ac008076 
! Ac008076 Homo 
Emhtg6:A1135910 
• A1135910 Human 



Strand: - 



Begin: 102317 End: 
apiens chromosome 18 . 

Begin: 88592 End: 
sculus clone RP23-36. 

Begin: 31435 End: 
sapiens clone RP11-43. 

Begin: 81374 End: 
:apiens chromosome 4, . 

Begin: 7400 End: 7415 Strand: - 
DNA sequence *** SEQ. . . 71 297 



102347 

88 
88631 

77 
31452 

75 
81390 

75 



299 

298 
Strand: 

298 
Strand: 
297 



:Ac009259 Begin: 71143 End: 71187 Strand: 
9259 Trypanosoma brucei chromos... 81 297 
Ac010729 Begin: 9984 End: 10012 Strand: 



Emhtgl 
! AcOO 

Emhtg2 _ 

! Ac010729 Homo sapiens clone RP11-35. 
Emhtg6:Ac024163 Begin: 134290 End: 
! Ac02 4163 Homo sapiens chromosome 3p. 
Emhtg2 ~ 
! AcOl 

Emhtg2 ^ 

! Ac01172 3 Homo sapiens chromosome 18. 



94 



128 



81 



76 



71 



96 



:Ac010413 
L0413 Homo 
! :Ac011723 



Begin: 177374 End: 
sapiens chromosome 5 . 
Begin: 53668 End: 



Emhtgl :Ac008367 Begin: 117968 End: 
! Ac008367 Drosophila melanogaster ch. 
WEnd of List 

9904247_3.Dna 
Emfun:Sc9718 



72 
134322 

84 
177391 
81 
53684 
76 

117995 
75 



296 
296 

Strand: 
294 



73 



84 



81 



295 



294 



76 



86 



284.0 1.7e-09 

285.4 1.9e-09 

283.2 2e-09 

283.6 2.1e-09 

282.0 2.3e-09 

282.8 2.4e-09 

283. 1 2.5e-09 

281.2 2.7e-09 

281.9 2.8e-09 
277.9 3.2e-09 

281.3 3.3e-09 

278.7 3.4e-09 



ID 
AC 
SV 
DT 
DT 
DE 
KW 
KW 

KW 
KW 

KW 
OS 

oc 
oc 

RN 
RP 
RA 
RT 
RL 
RN 
RP 
RA 
RT 
RL 
RL 

RL 
DR 
DR 

DR 
DR 
DR 
DR 
DR 



SC9718 standard; DNA; FUN; 35811 BP. 

Z49702; 271257; 
Z49702. 1 

21-MAY-1995 (Rel. 43, Created) 

ll-AUG-1997 (Rel. 52, Last updated, Version 1) 
S.cerevisiae chromosome XIII cosmid 9718. 

acetolactate synthase; aldehyde dehydrogenase; beta-transducin; 

Coiled coil protein; f olylpolyglutamate synthase; GALS; glycosyl hydrolase; 

guanine nucleotide binding protein; ILV2; membrane protein; 
myosin homologue; PGM2; phosphoglucomutase; protein kinase; SMR1; 

succinate dehydrogenase; transfer RNA-Leu; transfer RNA-Lys ; YKR2 ; YPK2 
Saccharomyces cerevisiae (baker's yeast) 

Eukaryota; Fungi; Ascomycota; Saccharomycetes; Saccharomycetales; 

Saccharomycetaceae; Saccharomyces . 

[1] 

1-35811 

Hunt S . , Bowman S . ; 

Unpublished. 
[2] 

1-35811 

Barrell B., Rajandream M.A.; 
; 

Submitted (19-MAY-1995) to the EMBL/ GenBank/DDBJ databases 

Saccharomyces cerevisiae chromosome XIII sequencing project, Sanger Centre, 

Hinxton Hall, Hinxton, Cambridge CB10 IRQ E-mail: barrell@sanaer ac nk 
SGD; S0004709; YMR103C. ««J-J- - "cirrexj.Lsanger.ac.uk 

SGD; S0004710; YPK2 . 

SGD; S0004711; PGM2 . 

SGD; S0004712; YKU80. 

SGD; S0004713; YMR107W. 

SGD; S0004714; ILV2. 

SGD; S0004715; MYOS . 
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24.07.2000/l3:ll;gft 



S0004719; 
S0004720; 
S0004721; YMR115W 
S0004722; ASCI. 
S0004723; SPC24. 
S0004724; YMR118C 



DR 


SWISS- 


•PROT; 


P07342; 


ILVB 


YEAST. 


DR 


SWISS- 


PROT; 


P18961; 


YPK2~ 


"yeast . 


DR 


SWISS- 


PROT; 


P37012; 


PGM2~ 


"yeast . 


DR 


SWISS- 


PROT; 


P38011; 


GBLP" 


~YEAST . 


DR 


SWISS- 


PROT; 


P54074; 


YM08" 


"yeast. 


DR 


SWISS- 


PROT; 


Q03177; 


YMZ2" 


"yeast . 


DR 


SWISS- 


PROT; 


Q04436; 


YMZ3~ 


"yeast . 


DR 


SWISS- 


PROT; 


Q04437; 


HDF2 


YEAST. 



SCORES 
99.8% 



Initl: 2470 Initn: 2470 Opt: 2476 z-score: 2560.5 E() 
identity in 497 bp overlap 



9904247_ 
Sc9718 

9904247 
Sc9718 

9904247_ 
Sc9718 

9904247_ 
Sc9718 

9904247 
Sc9718 



*10 20 30 

- 3 * Dn GCAGAAAT GAT GAAGGGT GTTAGC GC C GT C 

I I I 1 I i I I I I ! ! I I I i I I I I 1 M I I I I I I I 

AAAAGT CAT CT GGC AC GTTTAATT AT CAGAGCAGAAAT GAT GAAGGGT GTTAGC GC C GT C 
11680 11690 11700 11710 11720 11730 

40 50 60 70 80 90 

_3 . Dn C ACT GAT GT GC CT GGT AGT CAT GAT T TAC GT AT AACTAACAC AT CAT GAGGAC GGC GGC G 

> 1 ' > I I i I j I I I I I I t 1 ! I I I I I I I I I I I i | | | | | { | | | | | M | f | | j | | | | ] , , | , , , | 

CACT GAT GT GCCT GGT AGT CAT GAT T T AC GT AT AACTAACAC AT CAT GAGGAC GGC GGC G 
11740 11750 11760 11770 11780 11790 

100 HO 120 130 140 150 

_3 . Dn T CAC C C CAAC GCAAAAGAGT GACT T C C CT GC GCTTT GC CAAAACCC CAT AC AT C GC CAT C 

I f I I M I I E I I I I I I I I I I i I I I I 1 I I I I I I j | | | | I I I I I I I E I I I I I I I I I I I I ! I I I 

T CACCC CAAC GCAAAAGAGT GACTT C C CT GC GCT T T GCCAAAAC CC CAT ACATC GC CATC 
11800 11810 11820 11830 11840 11850 

180 190 200 210 



160 



170 



3 . Dn T GGCT C CT GGCAGGGC GGT T GAT GGACAT C AGC C GC CT C C CT TAATT GCTAAAGC CTC CA 

> ' M E I I I E I I I I I I | | | | | | I | | | | | I I I I I I M I f I I f I I I ! | | | | I I I f I I I I f I I | 
T GGCT C CT GGCAGGGC GGT T GAT GGACAT CAGC C GC CT C C CT TAATT GCTAAAGC CT C CA 
11860 11870 11880 11890 11900 11910 

22 0 230 240 250 260 270 

_3 . Dn CAAGGCACAATTAAGCAATATTTCGGGAAAGTACACCAGTCAGTTTGCGCTTTTATGACT 

I I E I I I I IN I I I I I 1 I I I I I I I I I I I 1 I I I M | | f | | I | | I I I I I I I I I I I I I I I I I I ! 
CAAGGCACAATTAAGCAATATTTCGGGAAAGTACACCAGTCAGTTTGCGCTTTTATGACT 
11920 11930 11940 11950 11960 11970 



280 290 300 310 320 330 

9904247_3 . Dn GGGTT CTAAGGTACTAGAT GTGAAGTAGT GGT GACAGAATCAGGGAGATAAGAGGGAGCA 

1 < I f I I I I I I I I I I I I i I I I I f I I I I I I I I I | | | 1 | | | | | | I I I I I I I I I f 1 I | | | | M i 
GGGTT CTAAGGTACTAGAT GT GAAGTAGT GGT GACAGAAT CAGGGAGATAAGAGGGAGCA 
11980 11990 12000 12010 12020 



Sc9718 



12030 



340 



350 360 370 380 390 

9904247_3 . Dn GGGT GGGGTAATGAT GT GC GATAACAAT CTT GCTT GGCTAAT CAC C C C CAT AT CT T GT AG 

1 1 1 1 1 1 1 1 1 ' I I I E I I M M I I I I I I I | M M I 1 f I I I f | | I I I 1 1 I I I 1 I M I I I I M I 
GGGT GGGGTAATGAT GT GC GATAACAAT CTT GCTT GGCTAAT CACC C C CAT AT CTT GTAG 



SC9718 



12040 



12050 



12060 



12070 



12080 



12090 
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g^o gOR-EXTEBKAL [TERMINAL- [4247-3 . LOG] j 24 . 07 . 2000/13 : 11 ; 58 

400 410 420 430 . 440 450 

9904247_3 . Dn T GAGT ATATAAATAGGAGCCT CCCTT CCTATTGCAACT CCATAAAATTTTT TTTTGTAGC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I If I I I I I 
SC9718 T GAGT ATATAAATAGGAGCCT CCCTT C CTATT GCAACT C CATAAAATTTTTTTTT GTAGC 

12100 12110 12120 12130 12140 12150 

460 470 480 490 

990424 7__3 . Dn CACTTCTGTAACAAGATAAATAAAACCAACTAATCGAGATATCACAT 
I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I || 

SC9718 CACTTCTGTAACAAGATAAATAAAACCAACTAATCGAGATATCAAATATGGGTAGTTTTT 
12160 12170 12180 12190 12200 12210 

SC9718 GGGACGCATT CGCAGTATACGACAAGAAAAAGCACGCAGATCCAAGT GTATATGGAGGAA 

12220 12230 12240 12250 12260 12270 

9904247_3 .Dna /rev 
Emhtg6:Ac023173 

ID AC023173 standard; DNA; HTG; 191479 BP. 
AC AC023173; 
SV AC023173.1 

DT 14-FEB-2000 (Rel. 62, Created) 

DT 14-FEB-2000 (Rel. 62, Last updated, Version 1) 

DE Mus musculus chromosome 6 clone RP23-11G22 strain C57BL6/J, WORKING 
DE DRAFT SEQUENCE, 11 unordered pieces. 
KW HTG; HTGS_DRAFT; HTGS_PHASE1. 

OS Mus musculus (house mouse) 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Teleostomi; 

OC Euteleostomi; Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; 

OC Mus . 

RN [1] 

RP 1-191479 

RA Beckstrom-Sternberg S.M., Benjamin B., Bouffard G.G., Dietrich N.L., 

RA Eagle W.O., Gan W. , Gupta J., Ho S.-L., Huang .M.C., Idol J., Jamison D.C., 

RA Lee-Lin S.-Q., Maduro Q.L., Maduro V.B., Mastrian S.D., McCloskey J.C., 

RA Morse E., Ojodu M.A., Pearson R., Stantripop S., Summers T.J., Thomas J.W., 

RA Thomas P.J., Tiongson E.E., Touchman J.W., Tran J.T., Vogt J.L., 
RA Walker M.A. , Wetherby K.D., Green E.D.; 

RT "NISC Mouse Sequencing Initiative"; 

RL Unpublished. 

RN [2] 

RP 1-191479 

RA Green E.D. ; 

RT 

RL Submitted ( 09-FEB-2000) to the EMBL/ GenBank/DDBJ databases. 

RL NIH Intramural Sequencing Center, 8717 Grovemont Circle, Gaithersburg, MD 

RL 20877, USA 

CC — Genome Center 

CC Center: NIH Intramural Sequencing Center 

CC Center code: NISC 

CC Web site: http://www.nisc.nih.gov 

CC Contact: nisc_mouse@nhgri.nih.gov 

CC — Project Information 

CC Center project name: ya 

CC Center clone name: 011G22 

CC — Summary Statistics 

CC Sequencing vector: plasmid; n/a; 100% of reads 

CC Chemistry: Dye-terminator Big Dye; 100% of reads 

CC Assembly program: Phrap; version 0.990319 

CC Consensus quality: 184222 bases at least Q40 

CC Consensus quality: 187020 bases at least Q30 

CC Consensus quality: 188777 bases at least Q20 

CC Insert size: 180000; pulse- field-gel 
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